3574 results found.
Written
Corpus,
Language Type:
Multilingual
Languages:
English German
Availability:
Freely Available
License:
<Not Specified>
Size:
1.8 million sentences Production Status:
Existing-used
Use:
<Not Specified>
-
Paper title:Jointly Learning to Embed and Predict with Multiple Languages
-
Paper track:Empirical/Data-Driven
-
Paper status:Accept - Poster - Tuesday
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Daniel C. Ferreira | Priberam, Instituto Superior Técnico | PT |
| Author 2 | André F. T. Martins | Priberam, Instituto de Telecomunicacoes | PT |
| Author 3 | Mariana S. C. Almeida | Priberam / Instituto de Telecomunicações | PT |
| Main Contact | Daniel C. Ferreira | Priberam, Instituto Superior Técnico | None |
Documentation:
<Not Specified>Language Type:
Multilingual
Languages:
English Japanese
Availability:
Freely Available
License:
OpenSource
Size:
3200000 sentences Production Status:
Newly created-finished
Use:
Corpus Creation/Annotation
-
Paper title:JESC: Japanese-English Subtitle Corpus
-
Paper track:Written
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country | ||
|---|---|---|---|---|---|
| Author 1 | Reid Pryzant | Stanford University | US | ||
| Author 2 | Youngjoo Chung | Rakuten Institute of Technology | JP | ||
| Author 3 | Dan Jurafsky | <Not Specified> | None | Stanford University | US |
| Author 4 | Denny Britz | JP | |||
| Main Contact | Reid Pryzant | Stanford University | None |
Documentation:
Public documentation will become available upon official release.Language Type:
Multilingual
Languages:
English
Availability:
From Owner
License:
<Not Specified>
Size:
84 MByte Production Status:
Newly created-finished
Use:
Machine Learning
-
Paper title:Sentence and Clause Level Emotion Annotation, Detection, and Classification in a Multi-Genre Corpus
-
Paper track:Evaluation
-
Paper status:Accept Poster+DemoSuggested
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Shabnam Tafreshi | The George Washington University | US |
| Author 2 | Mona Diab | GWU | US |
| Main Contact | Shabnam Tafreshi | The George Washington University | None |
Documentation:
<Not Specified>
Speech
Corpus,
Language Type:
Multilingual
Languages:
English Hindi
Availability:
From Owner
License:
<Not Specified>
Size:
43 MByte Production Status:
Newly created-finished
Use:
Speech Recognition/Understanding
-
Paper title:A Hindi-English Code-Switching Corpus
-
Paper track:Speech
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Anik Dey | HKUST | HK |
| Author 2 | Pascale Fung | <Not Specified> | None |
| Main Contact | Anik Dey | HKUST | None |
Documentation:
Documentation available upon request
Written
Evaluation Data,
Language Type:
Multilingual
Languages:
English
Availability:
Freely Available
License:
CreativeCommons
Size:
406 sentences Production Status:
Newly created-in progress
Use:
Evaluation/Validation
-
Paper title:CEFR-based Lexical Simplification Dataset
-
Paper track:Evaluation
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Satoru Uchida | Kyushu University | JP |
| Author 2 | Shohei Takada | Osaka University | JP |
| Author 3 | Yuki Arase | Osaka University | JP |
| Main Contact | Satoru Uchida | Kyushu University | None |
Documentation:
Readme.txt is included in the zip file.
Speech/Written
Corpus,
Language Type:
Multilingual
Languages:
English
Availability:
Freely Available
License:
open source
Size:
18628 words Production Status:
Newly created-in progress
Use:
Discourse
-
Paper title:Corpus Resources for Dispute Mediation Discourse
-
Paper track:Written
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Mathilde Janier | School of Computing | GB |
| Author 2 | Chris Reed | University of Dundee | GB |
| Main Contact | Mathilde Janier | School of Computing | None |
Documentation:
<Not Specified>
Written
Corpus,
Language Type:
Multilingual
Languages:
English
Availability:
Freely Available
License:
non-commercial
Size:
1224 exam scripts OtherProduction Status:
Existing-used
Use:
Learner Language Analysis
-
Paper title:Grammatical Error Annotation for Korean Learners of Spoken English
-
Paper track:General issues
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country | ||
|---|---|---|---|---|---|
| Author 1 | Hongsuck Seo | Postech | None | ||
| Author 2 | Kyusong Lee | Postech | None | ||
| Author 3 | Gary Geunbae Lee | Department of Computer Science and Engineering, POSTECH, Pohang, South Korea | N/A | Postech | None |
| Author 4 | Soo-Ok Kweon | Postech | None | ||
| Author 5 | Hae-Ri Kim | <Not Specified> | None | ||
| Main Contact | Hongsuck Seo | Pohang University of Science and Technology | KR |
Documentation:
The documentation is available in English on their website.
Not Applicable
Named Entity Recognizer,
Language Type:
Multilingual
Languages:
English
Availability:
Freely Available
License:
Gnu
Size:
66 MByte Production Status:
Existing-used
Use:
Person Identification
-
Paper title:Exploring the utility of coreference chains for improved identification of personal names
-
Paper track:Evaluation
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country | ||||
|---|---|---|---|---|---|---|---|
| Author 1 | Andrea Glaser | University of Stuttgart | DE | Institute for Natural Language Processing, University of Stuttgart | DE | ||
| Author 2 | Jonas Kuhn | Universität Stuttgart | None | University of Stuttgart | DE | Institute for Natural Language Processing, University of Stuttgart | DE |
| Main Contact | Andrea Glaser | University of Stuttgart | None |
Documentation:
http://nlp.stanford.edu/software/CRF-NER.shtmlLanguage Type:
Multilingual
Languages:
English
Availability:
Freely Available
License:
various Creative Commons
Size:
64 GByte Production Status:
Newly created-finished
Use:
Text Mining
-
Paper title:BioC and Simplified Use of the PMC Open Access Dataset for Biomedical Text Mining
-
Paper track:<Not Specified>
-
Paper status:Accept
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Rezarta Islamaj | National Center for Biotechnology Information, National Library of Medicine | US |
| Author 2 | Don Comeau | National Center for Biotechnology Information, National Library of Medicine | None |
| Author 3 | John Wilbur | National Center for Biotechnology Information, National Library of Medicine | None |
| Main Contact | Rezarta Islamaj | National Center for Biotechnology Information, National Library of Medicine | None |
Documentation:
<Not Specified>
Written
Corpus,
Language Type:
Multilingual
Languages:
English
Availability:
Freely Available
License:
<Not Specified>
Size:
75000 Production Status:
Existing-used
Use:
Word Sense Disambiguation
-
Paper title:Single Classifier Approach for Verb Sense Disambiguation based on Generalized Features
-
Paper track:Written
-
Paper status:Accept Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Daisuke Kawahara | Kyoto University | JP |
| Author 2 | Martha Palmer | University of Colorado | US |
| Main Contact | Daisuke Kawahara | Kyoto University | None |
Documentation:
<Not Specified>




